Capacity Releasing Diffusion for Speed and Locality

نویسندگان

  • Di Wang
  • Kimon Fountoulakis
  • Monika Henzinger
  • Michael W. Mahoney
  • Satish Rao
چکیده

Diffusions and related random walk procedures are of central importance in many areas of machine learning, data analysis, and applied mathematics. Because they spread mass agnostically at each step in an iterative manner, they can sometimes spread mass “too aggressively,” thereby failing to find the “right” clusters. We introduce a novel Capacity Releasing Diffusion (CRD) Process, which is both faster and stays more local than the classical spectral diffusion process. As an application, we use our CRD Process to develop an improved local algorithm for graph clustering. Our local graph clustering method can find local clusters in a model of clustering where one begins the CRD Process in a cluster whose vertices are connected better internally than externally by an O(log n) factor, where n is the number of nodes in the cluster. Thus, our CRD Process is the first local graph clustering algorithm that is not subject to the well-known quadratic Cheeger barrier. Our result requires a certain smoothness condition, which we expect to be an artifact of our analysis. Our empirical evaluation demonstrates improved results, in particular for realistic social graphs where there are moderately good—but not very good—clusters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Capacity Releasing Diffusion for Speed and Locality

Note an ineligible arc (v, u) must remain ineligible until the next relabel of v, so we only need to check each arc out of v once between consecutive relabels. We use current(v) to keep track of the arcs out of v that we have checked since the last relabel of v. We always pick an active vertex v with the lowest label. Then for any eligible arc (v, u), we know m(u) ≤ d(u), so we can push at leas...

متن کامل

When High-Capacity Readers Slow Down and Low-Capacity Readers Speed Up: Working Memory and Locality Effects

We examined the effects of argument-head distance in SVO and SOV languages (Spanish and German), while taking into account readers' working memory capacity and controlling for expectation (Levy, 2008) and other factors. We predicted only locality effects, that is, a slowdown produced by increased dependency distance (Gibson, 2000; Lewis and Vasishth, 2005). Furthermore, we expected stronger loc...

متن کامل

Simulating the Effects of Type and Spacing of Traffic Calming Measures on Urban Road Capacity

One of the major reasons for accidents is speed. Top of Form Inappropriate speed has been identified as the most important causal factor for serious traffic accidents. Traffic calming measures (TCMs) are engineering measures that are widely implemented to improve road safety by considerably reducing vehicle speed. TCMs have been widely used in urban areas to reduce vehicle flow rat...

متن کامل

Application of Magnetic Polymer Particles Modified with β–Cyclodextrin for Adsorption of Bovine Serum Albumin

Magnetic polymer particles which were modified by vinyl groups and subjected topolymerization by vinyl β-Cyclodextrin derivative, has been used as adsorbent for sorption andrelease of bovine serum albumin and the equilibrium and kinetics of the adsorption process werestudied. The absorbability and releasing of this protein through the new polymer has beenmeasured by ultraviolet-visible spectros...

متن کامل

Study on diffusion coefficient of benzene and ethyl benzene vapours in nanoporous silica aerogel and silica aerogel-activated carbon composites

In this study, nanoporous silica aerogel and silica aerogel-activated carbon composites have been synthesized using a water glass precursor by cost effective ambient pressure drying method. Equilibrium and kinetics of benzene and ethyl benzene adsorption on silica aerogel and its composites have been measured in a batch mode at tree weights of adsorbent. For the first time, the experimental dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017